AITopics

2511.01935

Country: Asia (0.28)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.68)

Jauhiainen, Jussi S., Toppari, Aurora

Generative Artificial Intelligence and Agents in Research and Teaching

arXiv.org Artificial IntelligenceAug-27-2025

This study provides a comprehensive analysis of the development, functioning, and application of generative artificial intelligence (GenAI) and large language models (LLMs), with an emphasis on their implications for research and education. It traces the conceptual evolution from artificial intelligence (AI) through machine learning (ML) and deep learning (DL) to transformer architectures, which constitute the foundation of contemporary generative systems. Technical aspects, including prompting strategies, word embeddings, and probabilistic sampling methods (temperature, top-k, and top-p), are examined alongside the emergence of autonomous agents. These elements are considered in relation to both the opportunities they create and the limitations and risks they entail. The work critically evaluates the integration of GenAI across the research process, from ideation and literature review to research design, data collection, analysis, interpretation, and dissemination. While particular attention is given to geographical research, the discussion extends to wider academic contexts. A parallel strand addresses the pedagogical applications of GenAI, encompassing course and lesson design, teaching delivery, assessment, and feedback, with geography education serving as a case example. Central to the analysis are the ethical, social, and environmental challenges posed by GenAI. Issues of bias, intellectual property, governance, and accountability are assessed, alongside the ecological footprint of LLMs and emerging technological strategies for mitigation. The concluding section considers near- and long-term futures of GenAI, including scenarios of sustained adoption, regulation, and potential decline. By situating GenAI within both scholarly practice and educational contexts, the study contributes to critical debates on its transformative potential and societal responsibilities.

artificial intelligence, large language model, machine learning, (20 more...)

2508.16701

Country:

Europe (1.00)
North America > United States (0.67)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Social Sector (1.00)
Leisure & Entertainment (1.00)
Law (1.00)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Mohajeri, Kaveh, Karami, Amir

Computationally Intensive Research: Advancing a Role for Secondary Analysis of Qualitative Data

arXiv.org Artificial IntelligenceJun-6-2025

This paper draws attention to the potential of computational methods in reworking data generated in past qualitative studies. While qualitative inquiries often produce rich data through rigorous and resource-intensive processes, much of this data usually remains unused. In this paper, we first make a general case for secondary analysis of qualitative data by discussing its benefits, distinctions, and epistemological aspects. We then argue for opportunities with computationally intensive secondary analysis, highlighting the possibility of drawing on data assemblages spanning multiple contexts and timeframes to address cross-contextual and longitudinal research phenomena and questions. We propose a scheme to perform computationally intensive secondary analysis and advance ideas on how this approach can help facilitate the development of innovative research designs. Finally, we enumerate some key challenges and ongoing concerns associated with qualitative data sharing and reuse.

data mining, machine learning, qualitative data, (22 more...)

doi: 10.17705/1jais.00923

2506.0423

Country:

Europe (0.68)
North America > United States > Maryland (0.28)

Genre:

Research Report > Experimental Study (1.00)
Overview (0.93)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
(3 more...)

arXiv.org Artificial IntelligenceJul-29-2023

Science in the Era of ChatGPT, Large Language Models and Generative AI: Challenges for Research Ethics and How to Respond

Pournaras, Evangelos

Since the release of popular large language models (LLMs) such as ChatGPT, the transformative impact of artificial intelligence (AI) on broader society has been unprecedented. This is particularly alarming for science and its conquest of truth (Chomsky et al., 2023). Generative AI and, particularly, conversational AI based on language models has set new ethical dilemmas for knowledge, epistemology and research practice. From authorship, to misinformation, biases, fairness and safety of interactions with human subjects, research ethics boards need to adapt to this new era in order to protect research integrity and set high-quality ethical standards for research conduct (van Dis et al., 2023). This paper focuses on reviewing these challenges with the aim of laying foundations for a timely and effective response. ChatGPT is an AI chatbot released in November 2022 by OpenAI. It is a Generative Pre-trained Transformer (GPT), a type of artificial deep neural network with a number of parameters in the order of billions. It is designed to process sequential input data, i.e. natural language, without labeling (self-supervised learning), but with remarkable capabilities for parallelization that significantly reduce training time. The model is further enhanced by a combination of supervised and reinforcement learning based on past conversations as well as human feedback to fine-tune the model and its responses (Stiennon et al., 2020; Gao,

large language model, machine learning, natural language, (18 more...)

2305.15299

Country:

Europe > United Kingdom > England > West Yorkshire > Leeds (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.68)

Industry:

Health & Medicine (0.93)
Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.98)

#artificialintelligenceOct-5-2021, 13:10:34 GMT

The different tribes of data scientists

Something that many employers are not aware of is that there is more than one path to become a data scientist. Data science, after all, is an umbrella term that encapsulates other fields, such as AI, machine learning and statistics. Since the rise of popularity of data science many people try to promote themselves as a data scientist, whereas in the past they might have used a different term. Also, many people are trying to get the right education. Understanding someone's background better can also help you make better hiring and management decisions.

data scientist, machine learning, statistics, (9 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.33)

#artificialintelligenceJul-25-2021, 17:05:53 GMT

No More "What" Without the "Why"

Throughout the last months, I had the chance to enable various organizations and leaders leveraging their large databases with machine learning. I was particularly engaging with member organisations which struggle with rising dropout rates (churns) -- an issue that became even more serious throughout the pandemic when individual income has been on a declining and the fear of job loss on a rising path. With machine learning, we used very large membership databases with individual-level information (e.g. Machine Learning tells us the "What", Causal Inference the "Why" Despite the overall good performance of the machine learning models, our clients were always interested in one obvious question: Why does an individual member leave? Unfortunately, machine learning models are not suited to identify the causes of things but rather they are built to predict things.

causal inference, causal relationship, causality, (14 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.50)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceOct-13-2020, 08:21:51 GMT

How Large a Sample Do You Need?

And is sample size even the right question? I regularly conduct both qualitative and quantitative research. Regardless of qual or quant I'm often asked about sample size. An economist might balk at the idea that you can get value from 5–10 1 hour interviews. Yet there is a lot of value in qualitative research that can't be achieved with quant. And quantitative research has its own perils -- including sample size issues. Questions about sample size are more complex than they appear. A proper answer requires nuance. It depends on the theoretical justification for the results, the effect size observed, the number of hypothesis, and more.

artificial intelligence, machine learning, sample size, (18 more...)

Genre:

Research Report > New Finding (0.36)
Research Report > Experimental Study (0.31)

Industry:

Health & Medicine (0.48)
Banking & Finance (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Druckenmiller, Hannah, Hsiang, Solomon

Accounting for Unobservable Heterogeneity in Cross Section Using Spatial First Differences

arXiv.org Machine LearningOct-16-2018

We propose a simple cross-sectional research design to identify causal effects that is robust to unobservable heterogeneity. When many observational units are adjacent, it may be sufficient to regress the "spatial first differences" (SFD) of the outcome on the treatment and omit all covariates. This approach is conceptually similar to first differencing approaches in time-series or panel models, except the index for time is replaced with an index for locations in space. The SFD approach identifies plausibly causal effects so long as local changes in the treatment and unobservable confounders are not systematically correlated between immediately adjacent neighbors. We illustrate how this approach can mitigate omitted variables bias through simulation and by estimating returns to schooling along 10th Avenue in New York and I-90 in Chicago. We then more fully explore the benefits of this approach by estimating effects of climate and soil on maize yields across US counties. In each case, we demonstrate the performance of the research design by withholding important covariates during estimation. SFD has multiple appealing features, such as internal robustness checks that exploit rotation of the coordinate system or double-differencing across space, it is immediately applicable to spatially-gridded data sets, and it can be easily implemented in statical packages by replacing a single index in pre-existing time-series functions.

artificial intelligence, machine learning, research design, (19 more...)

arXiv.org Machine Learning

1810.07216

Country:

North America > United States > New York (0.34)
North America > United States > Illinois > Cook County > Chicago (0.25)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Food & Agriculture > Agriculture (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

#artificialintelligenceDec-29-2017, 09:16:04 GMT

Data Science Research & Development - Internship - Civis Analytics

Are you passionate about model strategy and research design? Do you want to learn from data scientists and have an immediate impact on our work? Civis Analytics is looking for an Data Science Research and Development intern to join our team! Civis Analytics was born on the campaign trail, with CEO Dan Wagner and our founding members spearheading the 2012 Obama for America analytics team. Since then, our DC and Chicago teams have been building software and growing rapidly among a steadily developing client base in education, energy, government, healthcare, media, nonprofits, and politics.

data mining, machine learning, natural language, (12 more...)

Country: North America > United States > Illinois > Cook County > Chicago (0.29)

Industry:

Health & Medicine (0.39)
Government (0.39)
Law (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.56)
Information Technology > Artificial Intelligence > Natural Language (0.43)
Information Technology > Data Science > Data Mining (0.41)

#artificialintelligenceMay-17-2017, 02:45:21 GMT

Data science through the lens of research design

You are a data scientist or engaged in a data science project in your organization. You have one of the most interesting, influential, and intellectually stimulating jobs on the market. You've mastered stats, machine learning, become a programming wizard, an expert in visualization, a big data evangelist, and a math god. These last three years, our group has lead numerous data science projects across diverse verticals, including ad tech, fin tech, health tech, cloud computing, security, and the telecom industry. Surprisingly, many of our projects share similar attributes despite originating from different domains.

data mining, machine learning, operationalization, (12 more...)

Industry: Information Technology (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.76)
Information Technology > Communications > Social Media (0.51)
Information Technology > Data Science > Data Mining > Big Data (0.38)